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Abstract. We introduce optimization techniques for reasoning in —a re¬ 

cently introduced family of nonmonotonic description logics whose characteriz¬ 
ing features appear well-suited to model the applicative examples naturally aris¬ 
ing in biomedical domains and semantic web access control policies. Such opti¬ 
mizations are validated experimentally on large KBs with more than 30K axioms. 
Speedups exceed 1 order of magnitude. For the first time, response times compat¬ 
ible with real-time reasoning are obtained with nonmonotonic KBs of this size. 


1 Introduction 

Recently, a new family of nonmonotonic Description Logics (DLs), called , has 
been introduced Q. It supports normality concepts NC to denote the normal/standard/ 
prototypical instances of a concept C, and prioritized defeasible inclusions (DIs) 
C D with the following meaning: “by default, the instances of C satisfy D, un¬ 
less stated otherwise”, that is, unless some higher priority axioms entail C □ in 
that case, C D is overridden. The normal/standard/prototypical instances of C are 
required to satisfy all the DIs that are not overridden in C. 

Given the negligible number of applications based on nonmonotonic logics de¬ 
ployed so far, has been designed to address real-world problems and concrete 

knowledge engineering needs. In this regard, the literature provides clear and articu¬ 
lated discussions of how nonmonotonic reasoning can be of help in important contexts 
related to the semantic web, such as biomedical ontologies 024I27II (with several appli¬ 
cations, such as literature search) and (semantic web) policy formulation Il28l . These 
and other applications are extensively discussed in Q- 

The distinguishing features in D£^’s design are: (i) adopts the simplest pos¬ 
sible criterion for overriding, that is, inconsistency with higher priority axioms; {ii) 
all the normal instances of a concept C conform to the same set of default properties, 
sometimes called prototype in the following; {Hi) the conflicts between DIs that cannot 
be resolved with priorities are regarded as knowledge representation errors and are to be 
fixed by the knowledge engineer (typically, by adding specific DIs). No traditional non¬ 
monotonic logic satisfies (f), and very few satisfy {ii) or {Hi). VCJ^ behaves very well 
on applicative examples due to the following consequences of {i)-{iii) (a comparison 
with other nonmonotonic DLs with respect to these features is summarized in Table[T]i: 

No inheritance blocking: In several nonmonotonic logics a concept with exceptional 
properties inherits none of the default properties of its superclasses. This undesirable 
phenomenon is known as inheritance blocking. 

No undesired closed-world assumption (CWA) effects: In some nonmonotonic DLs, 
an exceptional concept is shrinked to the individuals that explicitly belong to it, if any; 
hence, it may become inconsistent. 




Control on role ranges: Unlike most nonmonotonic DLs, axioms can specify 
whether a role should range only over normal individuals or not. 

Detect inconsistent prototypes: facilitates the identification of all conflicts that 

cannot be resolved with priorities (via consistency checks over normality concepts), be¬ 
cause their correct resolution is application dependent and should require human inter¬ 
vention (cf. Q Sec. 1] and Example [T]below). 

Tractability: VC^ is currently the only nonmonotonic DL known to preserve the 
tractability of all low-complexity DLs, including and DL-lite (that underly the 

OWL2-EL and OWL2-QL profiles). This opens the way to processing very large non¬ 
monotonic KBs within these fragments. 
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(*) It holds for subsumption, assertion checking, concept consistency, KB consistency. 

Table 1. Partial comparison with other nonmonotonic DLs, cf.(7l, where CIRC, DBF, AEL, TYP, 
RAT, PR stand, respectively, for Circumscribed DLs, Default DLs, Autoepistemic DLs, DLs with 
Typicality, DLs with Rational Closure, and Probabilistic DLs. 

The performance of VC^ inference has been experimentally analyzed on large KBs 
(with more than 20K concept names and over 30K inclusions). The results are promis¬ 
ing; still, as defeasible inclusions approach 25% of the KB, query response time slows 
down enough to call for improvements. In this paper, we study two optimization tech¬ 
niques to improve VC^ query response time: 

1. Many of the axioms in a large KB are expected to be irrelevant to the given query. 
We investigate the use of module extractors 026123II to focus reasoning on relevant 
axioms only. The approach is not trivial (module extractors are unsound for most 
nonmonotonic logics, including circumscription, default and autoepistemic logics) 
and requires an articulated correctness proof. 

2. We introduce a new algorithm for query answering, that is expected to exploit incre¬ 
mental reasoners at their best. Incremental reasoning is crucial as DC^’s reasoning 
method iterates consistency tests on a set of KBs with large intersections. While 
the assertion of new axioms is processed very efficiently, the computational cost 
of axiom deletion is generally not negligible. We introduce an optimistic reasoning 
method that is expected to reduce the number of deletions. 

Both optimizations are validated experimentally. Speedups exceed 1 order of magni¬ 
tude. To the best of our knowledge, this is the first time that response times compatible 
with real-time reasoning are obtained with nonmonotonic KBs of this size. 

The paper is organized as follows: Sec.|2]provides the basics of DCJ^ and illustrates 
its inferences with examples. Sections[3]and|4]introduce the two optimization methods. 



























respectively, and prove their correctness. Their experimental assessment is in Sec. |5] 
We assume the reader to be familiar with description logics, see lfT4ll for all details. The 
code and test suites are available at; http : / /goo . gl/KnM091 


2 Preliminaries 

Let VC, be any classical description logic language (see |[T4l for dehnitions), and let 
VC]^ be the extension of VC with a new concept name NC for each VC concept C. 
The new concepts are called normality concepts. 

A knowledge base is a disjoint union KB — S C\V where <S is a hnite set of 
VC^ inclusions and assertions (called strong or classical axioms) and £> is a hnite set 
of defeasible inclusions (DIs, for short) that are expressions C D where C is a VC 
concept and D a VC^ concept. If <5 = (C D), then pre(5) and con((5) denote C 
and D, respectively. Informally speaking, the set of DIs satished by all the instances of 
a normality concept NC constitute the prototype associated to C. 

DIs are prioritized by a strict partial order If (5i -< ^ 2 , then Si has higher priority 
than ( 52 . VC^ solves automatically only the conhicts that can be settled using -<; any 
other conhict shall be resolved by the knowledge engineer (typically by adding suitable 
DIs). Two priority relations have been investigated so far. Both are based on specificity: 
the specihc default properties of a concept C have higher priority than the more generic 
properties of its superconcepts (i.e. those that subsume C). The priority relation used in 
most of Gl’s examples identihes those superconcepts with strong axioms only: 

A 82 iff pre(5i) C 5 pre(( 52 ) and pre(( 52 ) pre((5i) Q (1) 

The second priority relation investigated in jTJ is 

-< 82 iff rank{ 8 i) > rank{ 82 ), ( 2 ) 

where rankf) is shown in Algorithm[T]and corresponds to the ranking function of ratio¬ 
nal closure IIIOIIII . This relation uses also DIs to determine superconcepts, so (roughly 
speaking) a DI C D —^besides dehning a default property for C —gives the specihc 

default properties of C higher priority than those of D. The advantage of this priority 
relation is that it resolves more conhicts than ([T]i; the main advantage of ([T]i is pre¬ 
dictability; e.g. the effects of adding default properties to an existing, classical KB are 
more predictable, as the hierarchy used for determining specihcity and resolving con- 
Hicts is the original, validated one, and is not affected by the new DIs (see also the 
related discussion in 113141 . that adopt ([T]i). 

The expression KB a means that a is a VC^ consequence of KB. Due to space 
limitations, we do not report the model-theoretic dehnition of jii and present only its 
reduction to classical reasoning Q. For all subsumptions and assertions a, KB a 
holds iff KB^ 1 = a, where S is the set of normality concepts that explicitly occur 
in KB U {a}, and KB^ is a classical knowledge base obtained as follows (recall that 
KB^S\J V): 


' As usual, C C 5 D means that S \= C Q D. 





Algorithm 1: Ranking function 
Input: Ontology KB = S VJT) 

Output: the function rank{-) 

1 i := -1; So--{CQD\CQr^D£ V} 

2 repeat 

3 i ■.= i + 1 

4 £i+i-={CQDe£i\SuSi\=C C_L} 

5 forall Ds.t.C QD €£i\ £i+i do 

6 assign rank{C D) := i 

7 until £i+i = £i 

8 forall C On D € £i+i do assign rank{C On D) ~ oo 

9 return rank(-) 


First, for all DIs 5 e X) and all NC G T, let: 

jNC _ |-| □ con((5)) . (3) 

The informal meaning of is: “NC’s instances satisfy <5”. 

Second, let S' denote the result of removing from the axiom set S' all the 
such that So yi. 6: 

S' S' \ I NC e r A ^0 7^ <J} • 

Third, let Ji,..., 5\x>\ be any linearization of {V, a)11 

Finally, let ICB^ = /CSj^|, where the sequence )CBf (7 = 1,2,..., \V\) is induc¬ 
tively defined as follows: 

/CSf = 5 U {NC E c I NC e r} (4) 

KBf = K.Bf_^ U I 5^ G KB, NC G T, and 

/CSf_i U {5f^} ^ NC E X}. (5) 

In other words, the above sequence starts with /CS’s strong axioms extended with the 
inclusions NC E C, then processes the DIs Si in non-increasing priority order. If Si 
can be consistently added to C’s prototype, given all higher priority DIs selected so far 
(which is verified by checking that NC E -L in line Q), then its translation is 
included in KB^ (i.e. Si enters C’s prototype), otherwise Si is discarded, and we say 
that Si is overridden in NC. 

2.1 Examples 

We start with a brief discussion of X>£^’s conflict handling. Most other logics silently 
neutralize the conflicts between nonmonotonic axioms with the same (or incompara¬ 
ble) priorities by computing the inferences that are invariant across all possible ways 

^ That is, {5i,..., 5 |-d|} = "P and for all i, j = 1,..., \'D\, if Si A Sj then i < j. 







of resolving the conflict. A knowledge engineer might solve it in favor of some of its 
possible resolutions, instead; however, if the logic silently neutralizes the conflict, then 
missing knowledge may remain undetected and unfixed. This approach may cause seri¬ 
ous problems in the policy domain; 

Example 1. Suppose that project coordinators are both administrative staff and research 
staff. By default, administrative staff are allowed to sign payments, while research staff 
are not. A conflict arises since both of these default policies apply to project coordina¬ 
tors. Formally, K.B can be formalized with: 

Admin 3has_right.Sign (6) 

PrjCrd C Admin □ Research (8) 

Research -i3has_right.Sign (7) 

Leaving the conflict unresolved may cause a variety of security problems. If project co¬ 
ordinators should not sign payments, and the default policy is open (authorizations are 
granted by default), then failing to infer -iBhasjright.Sign would improperly autho¬ 
rize the signing operation. Conversely, if the authorization is to be granted, then failing 
to prove 3has_right.Sigii causes a denial of service (the user is unable to complete a 
legal operation). To prevent these problems, makes the conflict visible by infer¬ 
ring ICB N Pr j Crd C _L (showing that Pr j Crd’s prototype is inconsistent). This can 
be proved by checking that iCB^ \= NPrjCrd C J_, where S = {NPrjCrd}. Then 
K.B^ consists of ([8]), N Pr j Crd C Pr j Crd, and the translation of (|6]l and (|2l) (none 
overrides the other because none is more specific under any of the two priorities): 


N PrjCrd 13 Admin 3 3has_right.Sign, 

N PrjCrd 3 Research 3 -i3has_right.Sign. q 

Here is another applicative example from the semantic policy domain, showing X>£^’s 
behavior on multiple exception levels. 

Example 2. We are going to axiomatize the following natural language policy: “In gen¬ 
eral, users cannot access confidential files; Staff can read confidential files; Blacklisted 
users are not granted any access. This directive cannot be overridden.” Note that each 
of the above directives contradicts (and is supposed to override) its predecessor in some 
particular case. Authorizations can be reified as objects with attributes subject (the ac¬ 
cess requestor), target (the file to be accessed), and privilege (such as read and write). 
Then the above policy can be encoded as follows: 


Staff 

3 

User 

(9) 

Blklst 

3 

Staff 

(10) 

UserReqst 


-i3privilege 

(11) 

StaffReqst 


3privilege.Read 

(12) 

BlkReq 

3 

-i3privilege 

(13) 


where BlkReq 3 3subj.Blklst, StaffReqst 3 3subj.Staff, and UserReqst 3 
3subj.User. By (|9]l, both the speciflty relations ([T]i and ^ yield (fl^ ^ (ITT]) , that is. 


(fT2l) has higher priority than (fTTI) . Let S = {NStaf fReqst}; (fT^ overrides (fTTT i in 
NStaf f Reqst (under ([T]i as well as (|2])), so ICB^ consists of: (|9l), (fTOl) . (fTSl) . plus 

NStaffReqst IZ StaffReqst 
NStaffReqst PI StaffReqst Z 3privilege.Read 

Consequently, ICB \v NStaffReqst C 3privilege.Read. Similarly, it can be veri¬ 
fied that: 

1. Normally, access requests involving confidential files are rejected, if they come 
from generic users: ICB \v NUserReqst C -iBprivilege; 

2. Blacklisted users cannot do anything by (fOb . so, in particular: 

ICB |=s NBlkReq C -iBprivilege. □ 

Some applicative examples from the biomedical domain can be found in j?! (see Exam¬ 
ples 3, 4, 10, 12, and the drug contraindication example in Appendix C). Like the above 
examples, they are all correctly solved by VC^ with both priority notions. Applicative 
examples hardly exhibit the complicated networks of dependencies between conflicting 
defaults that occur in artificial examples. Nonetheless, we briefly discuss the artificial 
examples, too, as a mean of comparing with other logics such as 025111I5L 

In several cases, e.g. examples B.4 and B.5 in ||25]| . agrees with 025111151 
under both priority relations. Due to space limitations, we illustrate only B.4. 

Example 3 (Juvenile offender). Let ICB consist of axioms (fT^ - lfTSl l where J, G, M, P 
abbreviate JuvenileOffender, GuiltyOfCrime, IsMinor and ToBePunished, re¬ 
spectively. 
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NJ Z J 

(21) 

M Z„ 

(17) 

NjnMHG Z -,p 

(22) 

G Z„ P 

(18) 

NJZM Z -,P 

(23) 


On one hand, criminals have to be punished and, on the other hand, minors cannot 
be punished. So, what about juvenile offenders? The defeasible inclusion (fThl) breaks 
the tie in favor of their being underage, hence not punishable. By setting E = {NJ}, 
priorities ([U and (|2b both return axioms (fT9l) - (l2^ as ICB^. Then, clearly, ICB^ |= 
NJ C -iP which is VC^’s analogue of the inferences of II25I11151 . 

In other cases (e.g. example B.l in 1251) VC^ finds the same conflicts as 025111151 . 
However, X>£^’s semantics signals these conflicts to the knowledge engineer whereas 
in 025111151 they are silently neutralized. 

Example 4 (Double Diamond). Let ICB be the following set of axioms: 
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P Z„ -iS 
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DIs ( l26l l and ( l27l i have incomparable priority under ([TJ and (|2]l. Consequently, it is easy 
to see that NA C S and NA C -iS are both implied by and hence the knowledge 
engineer is warned that NA is inconsistent. The same conflict is silently neutralized in 
mrnm (A’s instances are subsumed by neither S nor -iS and no inconsistency arises). 
Similarly for the incomparable DIs (l28l l and (l30l l and the related conflict. 

The third category of examples (e.g. B.2 and B.3 in Il25ll ) presents a more variegated 
behavior. In particular, ifTTIl and VC^ with priority (|2]i solve all conflicts and infer the 
same consequences; 1251 solves only some conflicts; 0 is not able to solve any conflict 
and yet it does not raise any inconsistency warning; VC^ with priority ([T]i cannot solve 
the conflicts but raises an inconsistency warning. Here, for the sake of simplicity, we 
discuss in detail a shorter example which has all relevant ingredients. 

Example 5. Let fCB be the following defeasible knowledge base: 

A B (31) A C (32) B -nC (33) 

According to priority O all DIs are incomparable. Therefore, T>C^ warns (by inferring 
NA C _L) that the conflict between NA □ C and NA C -iC cannot be solved. Note that 
131 adopts priority ([TJ, too, however according to circumscription, any interpretation 
where A’s instances are either in -iC □ B or in C is a model, so A is satisflable (the 
conflict is silently neutralized). Under priority (|2j, instead, axiom (OTJ gives (OTl and 
([32j higher priority than (l33J . Consequently, NA C C prevails over NA C -iC. In this 
case, and rational closure infer the same consequences. 

3 Relevance and modularity 

The naive construction of )CB^ must process all the axioms in ICB^u = ICBq U | 
S € V, NC G S}. Here we optimize inference by quickly discarding some of 
the irrelevant axioms in ICB^n using modularization techniques. 

Roughly speaking, the problem of module extraction can be expressed as follows: 
given a reference vocabulary Sig, a module is a (possibly minimal) subset Ai C ICB 
that is relevant for Sig in the sense that it preserves the consequences of ICB that contain 
only terms in Sig. 

The interest in module extraction techniques is motivated by several ontology en¬ 
gineering needs. We are interested in modularization as an optimization technique for 
querying large ontologies: the query is evaluated on a (hopefully much smaller) module 
of the ontology that preserves the query result (as well as any inference whose signature 
is contained in the query’s signature). 

However, the problem of deciding whether two knowledge bases entail the same 
axioms over a given signature is usually harder than standard reasoning tasks. Conse¬ 
quently deciding whether ICB' is a module of ICB (for Sig) is computationally expen¬ 
sive in general. For example, DL-Litehom complexity grows from PTIME to coNP- 
TIME-complete ll20l : for ABC, complexity is one exponential harder |[T31 . while for 
AECQXO the problem becomes even undecidable ll22l . 

In order to achieve a practical solution, a syntactic approximation has been adopted 
in 0261181 . The corrisponding algorithm T_L*-Mod(5i3, IfCB) is defined in ll^ Def. 4] 






and reported in Algorithm |2] below. It is based on the property of _L-locality and T- 
locality of single axioms (line|2]l. An axiom is local w.r.t. Sig if the substitution of all 
non-Sig terms with _L (resp. T) turns it into a tautology. 

The module extractor identifies a subset Ad C ICB of the knowledge base and a 
signature Sig (containing all symbols of interest) such that all axioms in ICB \ Ad are 
local w.r.t. Sig. This guarantees that every model of Ad can be extended to a model of 
ICB by setting each non-Sig term to either ± or T. In turn, this property guarantees that 
any query whose signature is contained in Sig has the same answer in Ad and ICB. 

The function x-yio<l{Sig^ ICB) (lines 9-19), where x stands for T or _L, describes 
the procedure for constructing modules of a knowledge base ICB for each notion of 
locality. Starting with an empty set of axioms (line 11), iteratively, the axioms a that 
are non-local are added to the module (line|2]i and, in order to preserve soundness, the 
signature against which locality is checked is extended with the terms in a (line |2]l. 
Iteration stops when a fixpoint is reached. 

Modules based on a single syntactic locality can be further shrinked by iteratively 
nesting T-extraction into _L-extraction, thus obtaining T1.*-Mod{Sig, ICB) modules 
(lines |2]l2]l. 

The notions of module and locality must be extended to handle DIs, before we can 
apply them to VC^. Definition [T] generalizes the substitutions operated by the mod¬ 
ule extraction algorithm, abstracting away procedural details. As in ESI , both X and 
sig(X) denote the signature of X. 

Definition 1. (Module, locality) A T±*-substitution/or ICB and a signature Sig is a 
substitution a over ICB \ Sig that maps each concept name on T or _L, and every role 
name on the universal role or the empty role. A strong axiom a is cr-local iff a{a) is a 
tautology. A DIC 'Qn D is a-local iff C Q D is a-local. A set of axioms is a-local if 
all of its members are. We say that an axiom a is T -local (resp. Xlocal) if a is a-local 
where the substitution a uniformly maps concept names toT (resp. _L). 

A (syntactic) module of ICB with respect to Sig is a set A4 C ICB such that ICB \ A4 
is a-local for some T L*-substitution a for ICB and A4 U Sig. 

Let ModDi(<S'z 3 , ICB) be the variant of T_L*-Mod(5'ig, ICB) where the test in line 
|2]is replaced with (the complement of) the T or ±-locality condition of Def. [T](that 
covers DIs, too). Using the original correctness argument for T1.*-Mod{Sig, ICB) cf. 
ifTSl Prop. 42], it is easy to see that ModDi(<S'ig, ICB) returns a syntactic module of ICB 
w.r.t. Sig according to Def.[T] If ICB contains no DIs (i.e. it is classical), then Def. [1] is 
essentially a rephrasing of standard syntactic notions of modules and locality^ so 

for all queries a such that a C Sig, Ad |= a iff ICB ^ a. (34) 

However, proving that T±*-ModDi(5'ig, is correct fox full V£}^ is far from ob¬ 
vious; removing axioms from fCB using module extractors is incorrect under most 
nonmonotonic semantics (including circumscription, default logic and autoepistemic 
logic). The reason is that nonmonotonic inferences are more powerful than classical 

^ For efficiency, this test is approximated by a matching with a small set of templates. 

^ Informally, T_L*-Mod’s greedy strategy tends to find small Def.[T]s modules. 



Algorithm 2; T_L*-Mod(S'j5, KB) 

Input: Ontology KB, signature Sig 
Output: T_L*-module M of KB w.r.t. Sig 

// main 

1 begin 

2 M--KB 

3 repeat 

4 M'~M 

5 At := T-Mod(±-Mod(At, Sig), S't^) 

6 until At / At' 

7 return At 

8 end 


9 functionx-Modf/CB,^^^) // 2 ;€{A,T} 

10 begin 


11 
12 

13 

14 

15 

16 

17 

18 

19 

20 
21 end 


At := 0, r := KB 

repeat 

changed = false 

forall a € T do 

if a is not x-local w.r.t. Sig U At then 
At := At U {a} 

r:=r\w 

changed = true 


until changed = false 

return At 


inferences, and the syntactic locality criterions illustrated above fail to capture some of 
the dependencies between different symbols. 

Example 6. Given the knowledge base {T AU 5} and Sig = {A}, the module 
extractor returns an empty module (because by setting B — T the only axiom in the 
KB becomes a tautology). The circumscription of this KB, assuming that both A and B 
are minimized, does not entail ACT, while the circumscription of the empty module 
entails it. 

Example 7. Under the stable model semantics, the normal logic program 

p^^p,q 
q -<r 
r ->q 

entails r, both credulously and skeptically. The module extractor, given Sig = {r}, 
removes the first rule (that becomes a tautology by setting p =true). The module does 
not entail r skeptically anymore, and erroneously entails q credulously. Analogues of 
this example apply to default and autoepistemic logic, using the usual translations. They 
can be adapted to default DL ^ and the extension of DL based on MKNF ifT^ . 











Now we illustrate the correct way of applying T ±*-ModDi to a VCJ^ ICB — SU'D 
and a query a (subsumption or assertion). Let E be the union of a and the set of 
normality concepts occurring in ICB. Let 

Mq = ModDi(r,/CBUNr), 

where NZ" abbreviates {NC C C \ NC £ E}. 

Example 8. Let ICB be the knowledge base: 

A U B (35) BnC U A (37) 

AUr^ DUE (36) F U„ A (38) 

and a the query NA C D. Af o is calculated as follows: first, since no normality concept 
occurs in fCB, E is equal to the signature 5 = {NA, D}. 

Algorithml^calls first the function ±-Mod(/CB U NZ,Z). Notice that by replacing 
C and F with _L, axioms (iJTl l and (l38l l becomes a tautology. Consequently, it is easy to 
see that the returned knowledge base is ICB' = 1 (1551) . (I36L NA C A}. 

Then, T-Mod is called on ICB' and E. Now, replacing B with T makes A U B a 
tautology, so the resulting knowledge base is ICB" = 1 (1551) . NA C A}. It is easy to see 
that a fix point is reached and hence ICB" is returned. 

We shall prove that {ICB D A4q)^ can be used in place of ICB^ to answer query a. 
This saves the cost of processing ICB^n \ A4, where 

M = (ICBo n Xo) U {5^*^ I <5 e O n Mo, NC e E}. 

Note that ICB^n \X is usually even larger than KB\Mo because for each DI (5 ^ Xo, 
all its translations (NC £ E) are removed from X. 

Lemma 1. X is a module oflCB^n w.r.t. E. 

Proof. Since ModDi(-, •) returns modules, {ICB U NZ) \ Xo is cr-local, for some 
T^‘-substitution a for K-B U NZ and Xo U Z. So, for all axioms /3 in /CSjf \ X, jS 
is cr-local (as KBq \ X C {tCB U NZ) \ Xo)- Moreover, for all /3 = <5^^ C | 
6 £ V, NC £ Z} \ X, it must hold S £ D \ Xo (by construction of X), and hence 
6 is cr-local. Now note that if a{E C F) is a tautology, then also cr(NZ7 FI F C F) is a 
tautology, therefore the cr-locality of 5 implies the cr-locality of It follows that all 
/3 in ICB^ii \ X are cr-local. 

Finally, note that sig(/Ci3f;;) = sig(/C6 U NZ) and X U Z = Xo U Z, therefore 
cr is also a T^‘-substitution for ICB^n and X U Z. It follows immediately that X is a 
module of /CSf); w.r.t. Z. □ 

Lemma 2. If M is a module of ICB w.r.t. a signature Sig and ICB' C ICB, then K-B' D 
X is a module of KB' w.r.t. Sig. 


Proof. If A4 is a module of ICB w.r.t. Sig, then 1CB \ is cr-local for some T_L*- 
substitution a for ICB and A4 U Sig. Let a' be the restriction of a to the symbols in 

Sig) ='i^'\ (sig(/C6' n Ad) U Sig). 

Clearly, cr' is a T_L*-substitution for )CB' and ICB \ (sig{ICB' fl Ad) U Sig). Moreover, 
for all /3 C ICB' \ Ad, cr(/3) = cr'(/3), by construction, so ICB' \ Ad is cr'-local. Then 
ICB' n Ad is a syntactic module of ICB' w.r.t. Sig. □ 

The relationship between (ICB fl Adp)"^ and ICB^ is: 

Lemma 3. ICB^ n Ad C (/CS n Adp)^ C ICB^ . 

Proof. It suffices to prove by induction that for all* = 0,1,..., |I?|0 

/CSf n Ad c (/C6 n Ado)f c ICBf . 

Base case (i = 0): 

K-Bq n Af = (5 u Nr) n M c (5nX)uNX' = 

= (5nAdo)uNi: = (ICBnMo)o C5uNi: = ICB^ . 

Induction step [i > 0): By induction hypothesis (IH) 

iCBf_^ n Ad c (/cs n Mo)fLi c K:Bf_^ . 

First suppose that Si ^ Adp (and hence for all NC, ^ Ad). Then ICB^ fl Ad = 
ICBf_^ n Ad, (ICB n Adp)f = (ICB n Adp),^i, and (by def.) ICBf_i C ICBf. The 
Lemma follows by IH. 

Next assume that G Adp and let NC be any normality concept in S. Note that 
G Ad. By IH, 

(ICBti n M) U{df C (ICB n Afo)f-i C ICBf_i Us, u{5r‘='} . 

The leftmost term equals (ICBf_^ fl Ad C ICBfn, so by Lemmas[T]and 

|2]and (l34l l. the leftmost term entails NC L _L iff the rightmost does. It follows that the 
middle term (ICB fl A4())f_^ entails NC L _L iff the other two terms do. 

Then, 5)^'" is added to (ICB fl Adp)f' iff belongs to ICBf fl Ad and ICBf, and the 
Lemma follows using the IH. □ 

As a consequence, the modularized construction is correct: 

Theorem 1. (ICB n Adp)^ |= a ifflCB^ h «• 

Proof. By Lemmas [T] and |2] and (l34l l. fCB^ |= a iff ICB^ fl Ad ^ a. The Theorem 
then follows by Lemma [3 □ 

Here the sets (KB n A4o)f are defined by replacing S with S fl Ado, and KB with KB n Ado 
in (0 and 0, while Si ranges over all the DIs in KB, not just n Ado. This formulation 
facilitates the comparison with KB^ . By the condition Si G KB in 0 for all Si (f Mo, 
(KB n Mo)f = (KB n Mo)f-i, so this definition is equivalent to building (KB n Ado)^ 
using only 2? n Ado. 






4 Optimistic computation 


The construction of K-B^ repeats the concept consistency check Q over a sequence 
of knowledge bases U that share a (possibly large) common part 

JCBq , so incremental reasoning mechanisms help by avoiding multiple computations of 
the consequences of ICBq . On the contrary, the set of may change significantly at 
each step due to the filtering 4 ,^ 5 ^. This operation requires many axiom deletions, which 
as already highlighted in fT9l , are less efficient than monotonically increasing changes. 
The optimistic algorithm introduced here (Algorithm |2l computes a knowledge base 
ICB* equivalent to ICB^ in a way that tends to reduce the number of deletions, as it will 
be assessed in Sec.|5] 

Phase 1 optimistically assumes that the DIs with the same priority as do not 
contribute to entailing NC C _L in 0, so they are not filtered with in line[2 Phase 2 
checks whether the DIs discarded during Phase 1 are actually overridden by applying 
4 , 5 ^ (lines |3] and 0. DIs are processed in non-increasing priority order as much as pos¬ 
sible (cf. Iine0 so as to exploit monotonic incremental classifications. 

The following theorem shows the correctness of Alghorithm0in case the normality 
concepts do not occur in ]CB, but only in the queries. We call such knowledge bases 
N-free. It is worth noting that the optimistic method is not generally correct when ICB 
is not N-free and 1271 > 1, yet it may still be applicable after the module extractor if the 
latter removes all normality concepts from ICB. 

Theorem 2. If ICB is N-free, then Algorithm's output is equivalent to ICB^. 

Proof First assume that 27 is a singleton {NC}. We start by proving some invariants 
of lines 6-10. 

Claim 1; ICB^ |= 7T. 

Claim 2; If, for some j < i, Sf^ G ICB^ \ 77, then ICB^ |= NC C -L. 

We prove these two claims simultaneously. Both claims hold vacuously at the first 
execution of line 6. Next, assume by induction hypothesis that they hold at line 6 in 
some iteration; we have to prove that they still hold at the next iteration. There are 
two possibilities: First suppose that for some j < i, € ICB^ \ 77. By Claim 2, 
ICB^ \= NC C _L. This immediately implies that Claim 2 holds also at the next iter¬ 
ation. Moreover, it implies Claim 1 because all members of 77 have an occurrence of 
NC in the left-hand side, cf. 0. 

We are left with the case in which 

for all j < i, if e ICB^ then G 77. (39) 

If the condition in line0is true, then 77 is not changed, so Claim 1 must hold at the next 
iteration. Otherwise, by (|39] |, 

KB^ U 77' D KBf_^ { 5 , , (40) 

and hence NC C _L is not provable in 0, either. It follows that belongs to both 77 
(by line0 and ICB^ (by 0). This proves Claim 1 for iteration i. 


Concerning Claim 2, first suppose that the condition in line |3] is true; then either 
Claim 2 remains vacuously satisfied, or G ICB^ \ 11. The latter (plus the def. 
of and the ind. hyp. of Claim 1) implies that ICB^ U U' is entailed by KB^. It 
follows that JCB^ NC C _L as well, which proves Claim 2 in this case. Finally, if 
the condition in line[3]is false, then at line[3]5]^‘^ G 77. Together with (IWt , this implies 
that Claim 2 holds vacuously. 

Claim 3; If KB^ |= NC □ 1 then tZB* h NC C -L. 

Suppose not (we shall derive a contradiction). The assumption and Claim 1 imply 
that there must be some <5^*^ G ICB^ \ KB*. Let be the one with minimal k. Using 
minimality, it can be proved that 77 isi= KBf_i 4 . 5 ., so the concept consistency 

tests in lines|3]and[3(the latter instantiated with j = i and D = C) are equivalent to the 
one in (|5]l. But then G KB^ iff G ICB*, which contradicts the assumption, so 
Claim 3 is proved. 

Claim 4: If KB^ |= NC C -L then KB* = KB^. 

Note that KB* C /CSf UTTUlNC C -L} (cf. lines 11,15, and 22). Clearly, KB^ h 
KBq U 77 U {NC C _L} (by def.. Claim 1 and the assumption), so KB^ \= KB*. We 
are left to prove KB* ^ KB^. By Claim 3, KB* ^ NC C _L, and this inclusion in 
turn entails all G KB^ (cf. (|4|i). The other members of KB^ are those in KBq , by 
definition, and KB* D KBq (line 11). It follows that KB* |= KB^, which completes 
the proof of Claim 4. 

Claim 5; If KB^ ^ NC C -L then KB* = KB^. 

Suppose that KB^ ^ NC C _L. Then, by the contrapositive of Claim 2 and 
Claim 1, KBq U 77 = KB^, which further implies that the concept consistency tests in 
lines |3]and|3]are equivalent to the corresponding test in (|5]l. Then it can be proved that if 
any of these tests were true, then also KB^ |= NC C _L because, by TT’s construction, 
in that case 5i must be in conflict with some other DI with the same priority. However, 
KB^ ^ NC C _L contradicts the assumption. It follows that all tests in lines |3] and [3] 
are false, so KB* = KBq U 77, and we have already argued that this knowledge base 
is equivalent to KB^. This completes the proof for |i7| = 1. 

For |i7| > 1, note that the tests in lines[3]and[3(resp.[2l do not depend on any <5^^ 
such that E ^ C (resp. E ^ 77). Indeed, by E-locality, all such <5^^ are local w.r.t. the 
signature of KB U {NC C _L}, so they can be removed without changing the result of 
the concept consistency test ll26l . However, after their removal, the concept consistency 
tests correpond to the ones for the singleton case E = {NC}, which we have already 
proved correct. □ 


5 Experimental assessment 

Currently there are no “real” KBs encoded in a nonmonotonic DL, because standard DL 
technology does not support nonmonotonic reasoning. The nonmonotonic KBs encoded 
in the hybrid rule+DL system DLV-Hex lfT3l are not suited to our purposes because they 
do not feature default inheritance due to a restriction of the language: DL predicates 
cannot occur in rule heads, so rules cannot be used for encoding default inheritance. 
Consequently, synthetic test cases are the only choice for evaluating our algorithms. We 


Algorithm 3: Optimistic-Method 

Input: K.B — S\JV, E 

Output: a knowledge base KB* such that KB* = KB^ 

/ / Phase 1 

1 compute a linearization (5i,. .., 5\x>\ of 2? 

2 27 := 0 // 72 collects the prototypes 

3 Zi := 0 // ordered list of all discarded 

4 for i = 1, 2, ..., \T>\ do 

5 for NO € 72 do 

6 27' := 27 U {(5f 

7 if/CBf U27' ^NCClthen 

8 I 27 := 27' 

9 else 

10 append to A 

/ / Phase 2 

11 KB* = KBi^ U 27 

12 while Z\ / 0 do 

13 extract from A its first element 

14 if {KB^ U 27) ^ NC C _L then 

15 KB* ■- KB* U {NO C _L} 

16 extract all <5^^ with E = C from A 

17 else 

// is actually overridden 

18 & := 5i 

19 while A contains some Sf^ such that 5 -< 5j do 

20 extract from A the first such 5f^ 

21 if (KB^ U 27) 4,^5^ U{(5f ND C _L then 

22 KB* := KB* U {N72 C _L} 

23 extract all <5^^ with E = D from A 

24 S := Sj 




start with the two test suites introduced in Q as they have been proved to be nontrivial 
w.r.t. a number of structural parameters, including nonclassical features like exception 
levels and the amount of overriding. The two test suites are obtained by modifying 
the popular Gene Ontology (GOjl, which contains 20465 atomic concepts and 28896 
concept inclusions. In one test suite, randomly selected axioms of GO are turned into 
DIs, while in the second suite random synthetic DIs are injected in GO. The amount 
of strong axioms transformed into DIs is controlled by CI-to-DI-rate, expressed as the 
percentage of transformed axioms w.r.t. |GO| while the amount of additional synthetic 
DIs is controlled by Synthetic-DI-rate, i.e. the ratio |I?|/|GO|. The number of conflicts 
between DIs can be increased by adding an amount of random disjointness axioms 
specified by parameter DA-rate (see Q for further details). 

The experiments were performed on an Intel Core i7 2,5GHz laptop with 16 GB 
RAM and OS X 10.10.1, using Java 1.7 configured with 8 GB RAM and 3 GB stack 
space. Each reported value is the average execution time over ten nonmonotonic on¬ 
tologies and fifty queries on each ontology. For each parameter setting, we report the 
execution time of: (i) the naive VC^ reasoner of Q; (ii) the optimistic method intro¬ 
duced in Sec. |4] (Opt); (in) the module extraction method of Sec. [3 (Mod) using the 
module extraction facility of the OWLAPI; (iv) the sequential execution of Mod and 
Opt, i.e. Algorithm |3] is applied to ICB fl AJq- This combined method is correct by 
Theorem|2]and Theorem[T] 


CI-to-DI 

naive 

opt 

mod 

mod-l-opt 

05% 

12.91 

05.93 

00.30 

00.25 

10% 

22.37 

11.13 

00.32 

00.27 

15% 

31.50 

15.90 

00.37 

00.32 

20% 

42.97 

20.67 

00.40 

00.33 

25% 

55.22 

25.17 

00.44 

00.36 


Synth DIs 

naive 

opt 

mod 

mod-l-opt 

05% 

11.64 

06.94 

0.41 

0.42 

10% 

21.66 

11.21 

0.62 

0.67 

15% 

32.80 

14.90 

1.11 

1.64 

20% 

41.51 

18.82 

2.01 

1.42 

25% 

51.85 

22.33 

3.05 

2.09 


Table 2. Impact of \T>\ on performance (sec) - DA rate = 15% - priority lUll 


Table |2] shows the impact of the number of DIs on response time for the two test 
suites, as DI rate ranges from 5% to 25%. The methods Mod and Mod-nOpt are slightly 
less effective in the second suite probably because random defaults connect unrelated 
parts of the ontology, thereby hindering module extraction. In both suites, Opt’s speedup 
factor (w.r.t. the naive method) is about two, while on average Mod is approximately 
87 times faster in the first test suite (max. speedup 125), and 28 times faster in the 
second (max. speedup 35). On average, the combined method yields a further 13% 
improvement over Mod alone; the maximum reduction is 31% (2“'^ suite, Synthetic- 
DI-rate=25%, DA-rate=15%). The additional conflicts induced by injected disjointness 
axioms have moderate effects on response time (cf. Table O. Mod-nOpt’s average re¬ 
sponse time across both test suites is 0.7 sec., and the longest Mod-nOpt response time 


^ http://www.geneontology.org 



















has been 2.09 sec. As a term of comparison, a single classification of the original GO 
takes approximately 0.4 seconds. 


Test suite 1 (CTto-DI) 


DA 

naive 

opt 

mod 

mod-l-opt 

05% 

29.88 

13.21 

0.36 

0.31 

10% 

32.96 

14.08 

0.37 

0.32 

15% 

31.50 

15.90 

0.37 

0.32 

20% 

34.23 

16.23 

0.39 

0.33 

25% 

36.47 

17.80 

0.40 

0.34 

30% 

37.71 

18.09 

0.40 

0.34 


Test suite 2 (Synth. DIs) 


DA 

naive 

opt 

mod 

mod-l-opt 

05% 

28.20 

12.63 

0.99 

0.84 

10% 

30.18 

13.68 

1.04 

0.97 

15% 

32.80 

14.90 

1.11 

1.06 

20% 

35.68 

16.29 

1.18 

1.10 

25% 

37.46 

17.02 

1.25 

1.15 

30% 

38.37 

18.79 

1.36 

1.23 


Table 3. Impact of DAs on performance (sec) - DI rate = 15% - priority O 


CI-to-DI 

naive 

opt 

mod 

mod-l-opt 

05% 

22.01 

05.74 

00.30 

00.25 

10% 

52.82 

11.48 

00.32 

00.28 

15% 

81.84 

16.56 

00.34 

00.31 

20% 

133.62 

20.51 

00.38 

00.33 

25% 

193.27 

26.42 

00.41 

00.36 


Synth DIs 

naive 

opt 

mod 

mod-l-opt 

05% 

12.76 

07.21 

0.45 

0.46 

10% 

23.72 

14.44 

0.81 

0.86 

15% 

34.53 

17.05 

1.57 

1.21 

20% 

44.92 

21.77 

2.67 

1.96 

25% 

55.92 

25.77 

3.87 

2.46 


Table 4. Impact of |22| on performance (sec) - DA rate = 15% - priority ^ 


Table|4]is the analogue of Table|2]given priority (|2|l. With respect to priority ([T]|, the 
computation time for JCB^ and query answering in the first test suite grows faster for 
the naive algorithm, while there are smaller differences for the optimized approaches 
(the reponse times of the combined approach are almost identical). In the second test 
suite, the performance of the naive algorithms decreases less dramatically, while the 
optimized methods seem slightly less effective than in the first test suite. In all cases, 
the speedups of Mod and Mod-Opt remain well above one order of magnitude. The 
performance as DAs grow has similar features (see Tabled. 

The above test sets are N-free. We carried out a new set of experiments by randomly 
introducing normality concepts in DIs, within the scope of quantifiers]^ Specifically 
3R.C is transformed into 3i?.N C. The response times of the naive algorithm and MocO 
under priority ([T]i are listed in Table|6]for increasing values of 1271 (that is directly related 
to the amount of normality concepts occurring in K.E). We estimate that the values 
of 1271 considered here are larger than what should be expected in practice, given the 
specific role of explicit normality concepts, cf. footnote |6] Such values are also much 

® So far, all the applicative examples that are not N-free satisfy this restriction, as apparently 
the only purpose of explicit normality concepts is restricting default role ranges to normal 
individuals, cf. Ex. 12 and the nomonotonic design pattern in Gl Sec. 3.3]. 

’ In this setting Opt and Mod-l-Opt are not applicable, in general. 



































Test suite 1 (CI-to-DI) 


DA 

naive 

opt 

mod 

mod-l-opt 

05% 

84.53 

15.02 

0.34 

0.29 

10% 

90.38 

16.12 

0.35 

0.30 

15% 

91.84 

16.56 

0.35 

0.31 

20% 

92.93 

16.67 

0.36 

0.31 

25% 

93.54 

17.76 

0.37 

0.32 

30% 

96.37 

19.49 

0.38 

0.33 


Test suite 2 (Synth. DIs) 


DA 

naive 

opt 

mod 

mod-l-opt 

05% 

29.55 

14.93 

1.28 

1.07 

10% 

30.81 

15.82 

1.41 

1.15 

15% 

34.54 

17.05 

1.57 

1.21 

20% 

36.79 

16.93 

1.62 

1.27 

25% 

40.86 

17.90 

1.78 

1.36 

30% 

43.35 

18.74 

1.79 

1.34 


Table 5. Impact of DAs on performance (sec) - DI rate = 15% - priority © 


larger than in N-free experiments, where |27| is bounded by the query size. Response 
times increase accordingly. In most cases, the naive algorithm exceeded the timeout. 
In the first test suite. Mod remains well below 1 minute; in the second suite it ranges 
between 100 seconds and 10 minutes. The reason of the higher computation times in 
the second suite is that the extracted modules are significantly larger, probably due to 
the random dependencies between concept names introduced by fully synthetic DIs. 


1^1 

50 

100 

150 

200 

250 

Test suite 1 

naive 

mod 

1794.37 

2.31 

>30 min. 

7.26 

>30 min. 

14.77 

>30 min. 

25.32 

>30 min. 

39.22 

Test suite 2 

naive 

mod 

>30 min. 

103.4 

>30 min. 

211.5 

>30 min. 

327.4 

>30 min. 

459.2 

>30 min. 
586.7 


Table 6. Impact of normal roles values (sec) - DI rate = 25% DA rate = 15% 


6 Conclusions 

The module-based and optimistic optimizations introduced here are sound and com¬ 
plete, where the later applies only if the knowledge base is N-free. In our experiments, 
the combined method (when applicable) and the module-based method make rea¬ 
soning at least one order of magnitude faster (and up to ~780 times faster in some 
case). In most cases, optimized reasoning is compatible with real time reasoning. 
This is the first time such performance is reached over nonmonotonic KBs of this size; 
more than 20K concept names and over 30K inclusions 0 Our approach brings technol¬ 
ogy closer to practical nonmonotonic reasoning with very large KBs. Only the random 
dependencies introduced by synthetic DIs, combined with numerous restrictions of role 
ranges to normal individuals, can raise response time over 40 seconds; in most of the 
other cases, computation time remains below 2 seconds. 

* Good results have been obtained also for KBs with ~5200 inclusions under rational closure 
semantics I9I8I . 








































We are currently exploring a more aggressive module extraction approach, capable 
of eliminating some of the normality concepts in E and related axioms. Besides im¬ 
proving performance over non-N-free KBs, a more powerful module extractor might 
enable the application of the combined Mod-rOpt method to non-N-free VC^ knowl¬ 
edge bases, by removing all normality concepts from K.B before Opt is applied. 

We are also planning to adopt a different module extractor ||23| that is promising to 
be faster than the OWLAPI implementation. 

Last but not least, we are progressively extending the set of experiments by covering 
the missing cases and by widening the benchmark set, using real ontologies different 
from GO as well as thoroughly synthetic ontologies. 
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